fix pi0 prepare_language will raise an error if the task is a string by captainfffsama · Pull Request #1625 · huggingface/lerobot

captainfffsama · 2025-07-30T08:51:23Z

Fix: Handle string-based task inputs during Pi0 inference

Description:
When the task input is a string during inference with record.py, it was incorrectly split, leading to tokenization errors. This PR wraps the string in a list to ensure correct processing, aligning the behavior with smol-vla.

Copilot

Pull Request Overview

This PR fixes a bug in the Pi0 model where string-based task inputs during inference cause tokenization errors. The fix ensures string tasks are properly wrapped in a list and replicated to match the batch size.

Adds type checking to handle single string task inputs
Implements batch size replication for single task strings
Aligns Pi0 behavior with smol-vla model handling

Copilot · 2025-07-30T08:51:57Z

src/lerobot/policies/pi0/modeling_pi0.py

+        if isinstance(tasks, str):
+            tasks = [tasks]
+
+        if len(tasks) == 1:


The condition len(tasks) == 1 will always be true after the previous block that converts a string to a single-element list. This means single tasks will always be replicated to match batch size, even when the original input was already a list with one element that shouldn't be replicated.

Suggested change

if isinstance(tasks, str):

tasks = [tasks]

if len(tasks) == 1:

was_string = isinstance(tasks, str) # Track if the input was originally a string

if was_string:

tasks = [tasks]

if was_string and len(tasks) == 1: # Only replicate if the input was originally a string

Copilot · 2025-07-30T08:51:58Z

src/lerobot/policies/pi0/modeling_pi0.py

+        if isinstance(tasks, str):
+            tasks = [tasks]
+
+        if len(tasks) == 1:


The logic assumes that when len(tasks) == 1, it should replicate the task across the batch dimension. However, this doesn't distinguish between a single string input (which should be replicated) and a legitimate single-element list (which may not need replication). Consider checking the original input type or batch size mismatch instead.

Suggested change

if isinstance(tasks, str):

tasks = [tasks]

if len(tasks) == 1:

was_string = isinstance(tasks, str) # Track if the input was originally a string

if was_string:

tasks = [tasks]

if was_string or len(tasks) == 1 and len(tasks) != batch[OBS_STATE].shape[0]:

AdilZouitine · 2025-07-30T12:20:27Z

hey @captainfffsama, this issue will be fixed when #1431 #1452 will be merged 😄

fix pi0 prepare_language will raise an error if the task is a string

4a2b396

Copilot AI review requested due to automatic review settings July 30, 2025 08:51

Copilot AI reviewed Jul 30, 2025

View reviewed changes

CarolinePascal added bug Something isn’t working correctly policies Items related to robot policies labels Jul 30, 2025

captainfffsama closed this Aug 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix pi0 prepare_language will raise an error if the task is a string#1625

fix pi0 prepare_language will raise an error if the task is a string#1625
captainfffsama wants to merge 1 commit intohuggingface:mainfrom
captainfffsama:fix_pi0_prepare_lang_captain

captainfffsama commented Jul 30, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jul 30, 2025

Uh oh!

Copilot AI Jul 30, 2025

Uh oh!

AdilZouitine commented Jul 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

captainfffsama commented Jul 30, 2025

Fix: Handle string-based task inputs during Pi0 inference

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Jul 30, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jul 30, 2025

Choose a reason for hiding this comment

Uh oh!

AdilZouitine commented Jul 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants